A Bayesian approach to joint modeling of protein-DNA binding, gene expression and sequence data.

نویسندگان

  • Yang Xie
  • Wei Pan
  • Kyeong S Jeong
  • Guanghua Xiao
  • Arkady B Khodursky
چکیده

The genome-wide DNA-protein-binding data, DNA sequence data and gene expression data represent complementary means to deciphering global and local transcriptional regulatory circuits. Combining these different types of data can not only improve the statistical power, but also provide a more comprehensive picture of gene regulation. In this paper, we propose a novel statistical model to augment protein-DNA-binding data with gene expression and DNA sequence data when available. We specify a hierarchical Bayes model and use Markov chain Monte Carlo simulations to draw inferences. Both simulation studies and an analysis of an experimental data set show that the proposed joint modeling method can significantly improve the specificity and sensitivity of identifying target genes as compared with conventional approaches relying on a single data source.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian Joint Modeling of Multiple Gene Networks and Diverse Genomic Data to Identify Target Genes of a Transcription Factor.

We consider integrative modeling of multiple gene networks and diverse genomic data, including protein-DNA binding, gene expression and DNA sequence data, to accurately identify the regulatory target genes of a transcription factor (TF). Rather than treating all the genes equally and independently a priori in existing joint modeling approaches, we incorporate the biological prior knowledge that...

متن کامل

Gamma reactivation using the spongy effect of KLF1-binding site sequence: an approach in gene therapy for beta-thalassemia

Objective(s): β-thalassemia is one of the most common genetic disorders in the world. As one of the promising treatment strategies, fetal hemoglobin (Hb F) can be induced. The present study was an attempt to reactivate the γ-globin gene by introducing a gene construct containing KLF1 binding sites to the K562 cell line. Materials and Methods: A plasmid containing a 192 bp sequence with two repe...

متن کامل

Design and Production of Recombinant TAT Protein Structure, Catalytic Domain of Diphtheria Toxin, and Evaluation of Its Effect on Cell Line

Background and Objectives: Cancer is one of the most deadly diseases in the present age and its conventional therapies have had low success. Toxin therapy of cancer is a new therapeutic approach, which has attracted the attention of pharmaceutical specialists. Diphtheria toxin consists of three functional, transducing, and binding domains, that the functional part inhibits protein synthesis and...

متن کامل

Heterologous Expression of the Secale cereal Thaumatin-Like Protein in Transgenic Canola Plants Enhances Resistance to Stem Rot Disease

Canola (Brassica napus L.) is an important oilseed crop. A serious problem in cultivation of this crop andyield loss, are due to fungal disease stem rot caused by Sclerotinia sclerotiorum. The pathogenesis-related(PR) proteins have the potential for enhancing resistance against fungal pathogen. Thaumatin-like proteins(TLPs) have been shown to have antifungal activity on variou...

متن کامل

CLONING AND EXPRESSION OF LEISHMANOLYSIN GENE FROM LEISHMANIA MAJOR IN PRIMATE CELL LINES

Leishmanolysin is a worldwide disease that is caused by different species of the genus Leishmania. Leishmanolysin, One of the genes expressed by Leishmania, appears to be an ideal candidate for genetic vaccination. In this study, a full length sequence, which encodes Leishmanolysin functionally critical regions (amino acids 100-579), was cloned from a Leishmania strain endemic to Iran. Analysis...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Statistics in medicine

دوره 29 4  شماره 

صفحات  -

تاریخ انتشار 2010